Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Conversation Quality Assessment
# Conversation Quality Assessment
Hh Rlhf Rm Open Llama 3b
A reward model trained based on the LMFlow framework. It is trained on the HH - RLHF dataset (only the useful part) with open_llama_3b as the base model and has good generalization ability.
Large Language Model
Transformers
H
weqweasdas
483
18
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase